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Context. Cosmic shear is considered one of the most powerful methods for studying the properties of Dark Energy in the Universe. As a 
standard method, the two-point correlation functions of the cosmic shear field are used as statistical measures for the shear field. 
, Aims. In order to separate the observed shear into E- and B-modes, the latter being most likely produced by remaining systematics in the data 
set and/or intrinsic alignment effects, several statistics have been defined before. Here we aim at a complete E-/B-mode decomposition of the 
cosmic shear information contained in the on a finite angular interval. 
^ Methods. We construct two sets of such E-/B-mode measures, namely Complete Orthogonal Sets of E-/B-mode Integrals (COSEBIs), 
O ' characterized by weight functions between the £ ± and the COSEBIs which are polynomials in i? or polynomials in lni?, respectively. 

Considering the likelihood in cosmological parameter space, constructed from the COSEBIs, we study their information content. 
ffi . Results. We show that the information grows with the number of COSEBI modes taken into account, and that an asymptotic limit is reached 
which defines the maximum available information in the E-mode component of the £ ± . We show that this limit is reached the earlier (i.e., 
for a smaller number of modes considered) the narrower the angular range is over which £ ± are measured, and it is reached much earlier for 
fNj ■ logarithmic weight functions. For example, for <f+ on the interval 1' < & < 400', the asymptotic limit for the parameter pair (fi m , cr 8 ) is reached 
^ ' for ~ 25 modes in the linear case, but already for 5 modes in the logarithmic case. The COSEBIs form a natural discrete set of quantities, 
which we suggest as method of choice in future cosmic shear likelihood analyses. 
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1. Introduction tional shear from the shear components most likely not due 

to lensing, in order to (i) have a measure for the impact of 

The shear field in weak lensing is caused by the tidal com- Qther effectfj besMes lensing (such as insufficient PSF correc . 

ponent of the gravitational field of the mass distribution be- don for ^ shape measurementSi or intrinsic alignme nt ef- 
£ tween us and a distant population of sources (see Mellier 1999; fects) Qn ±e observed shear field; and to isolate the lens . 
d Bartelmann & Schneider 2001 ; Refregier 2003; Schneider et al. ing shear and tQ compare h wkh ±Q expectation from cosmo . 
2006; Munshi et al. 2008, for recent reviews). If the shear, esti- logical modelfj Indeed almQSt ^ more recent cosmic shear 
mated from the image shapes of distant galaxies, is solely due suryeys perform such an E . /B . mode deC omposition of second- 
to gravitational lensing, then it should consist only of a 'gra- Qrder ^ measures (e _ fr> H oekstra et al. 2002; Jarvis et al. 
dient component', the so-called E-mode shear (see Crittenden 2003; Hetterscheidt et al . 2007; Fu et aL 2008) . 
et al. 2002; Schneider et al. 2002). B-modes (or curl compo- 
nents) cannot be generated by gravitational light deflection in The standard technique for this separation is the aperture 
leading order, and higher-order effects from lensing are ex- dispersion (Ml p (d)} and (m^(6»)^ (Schneider et al. 1998), which 
pected to be small, as seen in ray-tracing simulations through can be calculated in terms of the shear two-point correlation 
the cosmological density field (e.g., Jain et al. 2000; Hilbert functions (2PCFs) on a finite interval < # < 26. 
et al. 2009). Alternatively, one can construct E- and B-mode shear correla- 
Therefore, the splitting of the observered shear field into its tion functions (Crittenden et al. 2002), which, however, can be 
E- and B-modes is of great importance to isolate the gravita- calculated only if the shear correlation function^ is known for 
arbitrarily large separations. As was pointed out by Kilbinger 
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sion requires the knowledge of the shear correlation functions 
down to zero separation, together with the inability to measure 
the shape of image pairs with very small angular separation, 
leads to biases in the estimated values for the aperture disper- 
sions, in particular to an effective E-/B-mode mixing. 

For that reason, Schneider & Kilbinger (2007) - hereafter 
SK07 - developed a new second-order shear statistics, that can 
be calculated from the shear correlation functions £ ± on a finite 
interval i? m in < ft < # max and which provides a clean sepa- 
ration of E- and B-modes. In particular, SK07 derived general 
expressions for the relation between E-/B-mode second-order 
shear quantities and the shear 2PCFs. They considered one par- 
ticular example of such a relation, leading to the so-called the 
ring statistics, based solely on geometric considerations. Eifler 
et al. (2010) and Fu & Kilbinger (2010) - hereafter FK10 - 
have shown that, although the signal-to-noise at fixed angular 
scale is smaller for the ring statistics than for the aperture dis- 
persion, the correlation matrix between measurements at dif- 
ferent angular scales is considerably narrower in the case of 
the ring statistics, yielding that the information contents of the 
two measures are quite comparable. Applying the ring statis- 
tics to the same cosmic shear correlation functions as used by 
Fu et al. (2008) in their measurement from the Canada-France- 
Hawaii Telescope Legacy Survey, Eifler et al. (2010) obtained 
a clear signal, as well as a better localization of the remaining 
B-modes. 

In FK10, more general E-/B-mode measures have been 
considered, based on the general transformation derived in 
SK07. Specifically, FK10 have constructed E-mode quanti- 
ties which maximize the signal-to-noise for a given interval 
# m in < # < #max, or which maximize the figure of merit in 
parameter space, as obtained from considering the Fisher ma- 
trix. Both of the resulting E-mode statistics are by construction 
superior to the ring statistics, and also yield higher signal-to- 
noise, or a higher figure-of-merit, than the aperture dispersion. 

In this paper, we construct sets of E-/B-mode measures, 
E n and B„, based on shear correlation functions on a finite 
interval. In a well-defined sense, for a given angular interval 
# m in < # < #max, these second-order E-/B-mode measures 
form a complete set each, so that all E-B-separable informa- 
tion contained in the £+(#) is also contained in this complete 
set. With these complete sets of second-order shear measures, 
we propose a new approach to compare observed shear corre- 
lations with model predictions. Whereas all such comparisons 
done hitherto define a second-order shear measure as a function 
of angular scale [such as %±(&) or ^M 2 p (0)^], the choice of the 
grid points in the angular scale being arbitrary, the complete 
set of the E n are a 'natural' discrete set of quantities that can 
be used in a likelihood analysis. One can hope that a finite and 
possibly rather small number of the E„ contains most of the 
cosmological information, depending on the choice of the set. 

In Sect. 2 we summarize the general equations for E-/B- 
mode measures obtained from the two-point correlation func- 
tions of the shear field over a finite interval, and derive the 
covariance matrix for a set of such E-B-mode measures. We 
then construct in Sect. 3 two examples of Complete Orthogonal 
Sets of E-/B-mode Integrals (COSEBIs), one of them using 
weight functions which are polynomials in the others being 



polynomials in In ft. In the former case, explicit relations for 
the corresponding weight functions are obtained for any poly- 
nomial order, whereas in the logarithmic case the coefficients 
have to be obtained through a matrix inversion. In Sect. 4, we 
then investigate the information content of these COSEBIs, by 
calculating the likelihood of cosmological parameter combina- 
tions and the corresponding Fisher matrix for a fiducial cosmic 
shear survey, using the two COSEBIs constructed, as well as 
the original shear correlation functions. We conclude by dis- 
cussing the advantages of the COSEBIs over the other second- 
order shear measures that have been suggested in the literature. 
In Appendix B, we show how COSEBIs can be used to maxi- 
mize the signal-to-noise of a cosmic shear E-mode measure. In 
addition we show how to construct pure E/B-mode correlation 
functions from the COSEBIs and relate them to the 2PCF. 

2. E-/B-mode decomposition 

In SK07 we have shown than an E-/B-mode separation of 
second-order shear statistics is obtained from the 2PCFs £ ± by 

e = x - J d&& pv(#)£ + (#) + r_(0)f_(0)] , 

b = ±f d##[r + (#)f + (#)-r_(#)f_(#)] , (i) 

provided the two weight functions T ± are related through 

I d&#J (€&)T + (&)= I d00J 4 (£0)r_(0) (2) 
Jo Jo 

or, equivalently, 



r + (0) = r_(#) ■ 
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In this case, E contains only E-modes, whereas B depends only 
on the B-mode shear. Furthermore, it was shown in SK07 that 
an E-mode second-order statistics is obtained from the shear 
correlation functions on a finite interval # m i n < ft < # m ax if the 
function T+ vanishes outside the same interval, and in addition, 
the two conditions 



d§ ft T + (ft) = = d§& 3 T + (§) 

#mm Jl'min 



(4) 



are satisfied; in this case, the function T-(ff) as calculated from 
Eq. (3) also has finite support on the interval tf mm < ■& < # max . 
In SK07, a particular set of functions T± was introduced, orig- 
inating from the geometrical construction of cross-correlating 
the shear in two non-overlapping annuli, and the corresponding 
estimators were termed 'ring statistics'. 

The origin of the conditions expressed in Eq. (4) can be un- 
derstood as follows: A uniform shear field cannot be assigned 
an E- or B-mode origin. Such a shear field gives rise to shear 
correlation functions of the form £+(#) = const, and £-(#) = 0. 
According to the first of Eq. (4), this component is filtered 
out in Eq. (1). Furthermore, one possibility to distinguish be- 
tween E- and B-modes is the consideration of the vector field 
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m = (71,1 + 72,2, 72,1 - 71,2) constructed from partial derivatives 
of the shear field 7(1?) (Kaiser 1995). A pure E-mode shear 
yields a vanishing curl of u, whereas a pure B-mode shear leads 
to V ■ u — 0; a shear field which yields V ■ u =0 = curl(«) cannot 
be uniquely classified as E- or B-mode. 

If we now consider a shear field which depends linearly 
on §, then the vector field u is constant, and thus it cannot be 
uniquely split into E- and B-modes. On the other hand, such 
a shear field gives rise to correlation functions of the form 
- A + Bd 2 , = 0, where A and B are constants. 

Again, the correlation function of such a shear field is filtered 
out due to the conditions in Eq. (4). 

2.1. E-/B-modes from a set of functions 

Of course, there are many functions T + (§) which satisfy the 
constraints in Eq. (4). Assume we construct a set of functions 
T +n (ff) which all satisfy Eq. (4) and which are, in a way spec- 
ified later, orthogonal. Then one can construct the correspond- 
ing r_„(#) from Eq. (3), and thus one obtains the set E„ and B n 
of second-order shear measures with a clean E-/B-mode sep- 
aration. Each of the E„ and B„ measures an integral over the 
power spectrum of E- and B-modes, respectively, 



r 3 die 
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where the filter functions are 



w„(0 



d0 0T +B (0)Jo(fl?) , 



(5) 



(6) 



and where we made use of the relation between the shear cor- 
relation functions and the power spectra (see, e.g., Schneider 
et al. 2002) 
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We next calculate the covariance of the E- and B-mode mea- 
sures making use of Eq. (5), 



— (E m E n ) {E m ) {E n ) 

r°° d{ e r 
-=-w m (fi) 

Jo Hi Jo 

= -f 

nA Jo 



dl' (' 
2n 



W n (f)(APE(t)AP E (f)) 



dt ew m ({)WniO [Pe(0 + n £ y 



(8) 



where in the final step we have assumed a Gaussian shear field 
and used the corresponding expression for the covariance of 
the power spectrum from Joachimi et al. (2008). Here, A is the 
survey area, N 6 = <j^/(2h) is the amplitude of the white noise 
power spectrum resulting from the intrinsic ellipticity distribu- 
tion of sources, <x e is the dispersion of the intrinsic ellipticity, 
and h is the mean number density of sources. The covariance 
of the B„, Cf nn , has exactly the same form, with Pe replaced by 
Pb, and the covariance between the E„ and B,„ vanishes. 



As a consistency check, we calculate the covariance in a 
different form, starting from the relation between the E„ and 
the shear correlation functions. We then obtain 



C 



1 r*&msa /^rnax 

= - d&§ I d§' &' 



(9) 



/J,v=l+,-| 



where C ±± {§,§') is the covariance of the shear correlation 
function %±{ff). Using the relations of Joachimi et al. (2008) 
for the covariance of the f ± , assuming a Gaussian shear field, 
and making use of Eq. (2), the result (8) is re-obtained. 

The comparison of the £'° bs obtained from observations 
with those of a model E„{n), where n denotes a set of M model 
parameters, can then be done via 

X 2 = J [£° bs - E m (n)] {C E H [E^ - E„(n)] , (10) 

m,n=\ 

where is the maximum number of E-modes considered, or 
with a likelihood function 



£ = [(2nf /2 VdetC E J ' e 



-X 2 /2 



(ID 



2.2. Calculation of E-mode second-order statistics 
from ray-tracing simulations 

Due to the limited range of validity of analytic approxima- 
tions for the calculation of cosmic shear statistics, ray tracing 
through N-body simulated three-dimensional density distribu- 
tions are carried out (see, e.g., Jain et al. 2000; Hilbert et al. 
2009, and references therein). As shown in these papers, the 
resulting B-mode shear is several orders of magnitude smaller 
than the E-mode shear, so that the resulting shear field can 
be described very accurately in terms of an equivalent sur- 
face mass density k(0). It is often faster to derive statistical 
properties of the resulting shear field from the corresponding 
properties of the A:-field. For example, the aperture mass M ap 
(Schneider 1996) can be obtained from the shear field through 
a radial filter function Q, but also from the /(--field through a re- 
lated radial filter function U. Hence, one can calculate the field 
of M ap from the equivalent surface mass density, convolved 
with the filter U, and the aperture mass dispersion is then given 
as the dispersion of this field. In this way, no correlation func- 
tions of the shear need to be obtained for making predictions, 
saving computation time. 

Here we will show that, similar to the case of the aperture 
mass dispersion, the E-mode second-order shear statistics de- 
fined in Eq. (1) can be obtained from a simulated k- field, with- 
out the need to calculate the shear correlation functions. For 
that we note that, in the absence of B-modes, one has 
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and that the correlation functions of k and y agree, 
{k{0)k{0')) = {y{0)y\e')) = . 
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If we smooth the convergence field with a radial filter function 
F, obtaining 



KM 



j d 2 ff k(0' 



)F(\e-e'\) 



(12) 



the correlator of the smoothed field with the unsmoothed field 
at zero lag becomes 



Setting # = 0' - 0, we see that 

E = (k(0)k s W) , 



(13) 



(14) 



if we choose F{&) = {2ny y l '+(#)■ Hence, the calculation of 
E from simulations can proceed by convolving the /(-field with 
the function T + {ff)l{2n), and correlating the resulting field with 
the original /c-field, dropping a band of width # max along the 
boundaries of the field where the convolution via FFT causes 
artifacts. 

3. Complete sets of weight functions 

Here, we construct complete sets of functions which satisfy the 
constraints (4) for the weight function T + (i?) on the interval 
i?min < # < $max- It should be noted that, once a complete set 
of such functions is known, the maximization of the signal-to- 
noise of the second-order E-mode shear - a problem considered 
in FK10 - reduces to a linear algebra problem, as shown in 
Appendix B. 

Readers less interested in the explicit construction of these 
COSEBIs can go directly to Sect. 4. 

3.1. Polynomial weight functions 

First, we construct a complete set of weight functions which are 
polynomials in §. To do so, we transform the interval # m ; n < 
& < # m ax onto the unit interval - 1 < x < 1 , by defining 



2(# - #) 



(15) 



with § = (# min + # max )A A# = # max - #min- In addi- 
tion, we define the relative interval width B = A#/(2i?) = 

(#max - #min)/(#max + #min)- TllUS, as 1? Varies from # min tO 

#max, x goes from -1 to +1. Then we set T +n {§) = t +n {x), and 
r_„(#) = f_„(x). The t +n are chosen to be polynomials in x; 
as Eq. (15) is a linear transformation, the polynomial order is 
preserved. Furthermore, we require that the set of functions are 
orthonormal, i.e., 



x: 



dx t +n {x) t +m (x) = 6„ 



(16) 



The first two functions of the set are constructed 'by hand': 
The lowest-order polynomial which can satisfy the constraints 
(4) and the normalization constraint (16) is of second order. 
Hence, we choose t + \{x) to be a second-order polynomial, and 
determine its three coefficients from the three constraints. The 
lowest-order polynomial which can satisfy the two constraints 
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Fig. 1. The linear filter functions T+„(&) for i? m j„ = 1', i? max = 400'. 
Note that the shape of the curves depends only on the ratio & w 
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(4) and the orthonormality relation (16) for m = 1, 2 is of third 
order, and its four coefficients are determined accordingly; this 
yields 



t + i(x) = — L |3fi 2 - 5 - 6Bx + 3(5 - B 2 )x 2 ] , 



t +2 (x) = — L|# 3 (25 + 3B 2 ) - 15(35 + 9B 2 + SB 4 )x 

VX2 



- 15fi 3 (3 + B 2 )x 2 + 35(25 + 5B 2 + 6B 4 )x 3 ] , (17) 



with 



Xi = 8(25 + 5B 2 + 6B 4 )/5 



X 2 = 8(25 + 5B 2 + 6B 4 )(175 + 35B 2 + 45B 4 + B 6 ) 



(18) 



To obtain the higher-order functions of this set, we note that the 
Legendre polynomials P n {x) are orthogonal, and that 



dx P„(x) x m = for m < n 



This shows that the constraints (4), written in terms of x, are 
satisfied if we choose f+(x) ex P n (x) for all n > 4. Furthermore, 
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Fig. 2. The functions W„ as defined in Eq. (6) which relate the 
COSEBIs to the underlying power spectrum, calculated from the T ± „. 
The upper panel corresponds to ?? max = 400', whereas the lower panel 
is calculated using i? max = 20', both for & mm = V 

the P n (x) for « > 4 are orthogonal to t+i(x) and f+2(x), since 
the latter are polynomials of order < 3. Thus, choosing the nor- 
malization such as to satisfy Eq. (16), we find for n > 3, 



t+ n (x) 



2n + 3 



■P n +l(x) = p n +l(x) 



(19) 



In the upper panel of Fig. 1, we have plotted the filter function 
T +n (ff) for three values of n. For n > 3, they are simply pro- 
portional to the Legendre polynomials. Note that T +n {ff) has 
(n + 1) roots in the interval i? m j n < ■& < § m:ix , and the normal- 
ization is chosen such that r + „()? max ) > 0. The corresponding 
filter functions W n {() which relate the COSEBIs to the power 
spectrum Pe{£) are displayed in Fig. 2, for several values of n 
and for two different values of the relative width parameter B 
(corresponding to two different values of # ma x)- 

For this set of functions t +n (x), we can obtain the corre- 
sponding t- n (x) using Eq. (3), 



Ux) = t+(x) + 



45 



(1 +Bx) 2 



dy t+(y)G(y,x) 



(20) 



where 



G(y,x) = \+By-3 



(l + Byf 

(1 +Bx) 2 



2>^ 



(21) 
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and the coefficients A* are given explicitly as 
3 9B 



A = 1 

A 2 = 



(1 +Bx) 2 
-9B 2 



(1 +Bx) 2 ' 



A, = 



B 



-3B 3 



(1 +Bx) 2 



(1 +Bx) 2 



(22) 



For the first two functions, the integral is carried out explicitly, 
yielding 



t-i(x) 



1 



^JT x (\+Bx) A 

1 Yu 

1 + Bx) 4 fcj 



k=0 
1 



t-i(x) = — — > UitX" 

with the coefficients U 

-5 + 19B 2 - 15fi 4 + 3fl 6 , 



(23) 



U n 
Un 
U n 

1/20 

U22 
U23 
U 2A 

U 25 
U 26 
U21 



2B(1 + B - 3B 4 ) , 

15 + IB 2 + B 4 - 3B 6 , 

20B , U u = 10B 2 , U i5 = 2B 3 ; 

-fi(350 - 360B 2 + 182B 4 - 93B 6 + 21B 8 ) , 
-525 + 215B 2 - 30B 4 + 38B 6 + 18B 8 , 
fi 3 (130 + 30B 2 + 19B 4 + 9B 6 ) , 
5(175 + 105B 2 + 48B 4 + 12B 6 ) , 
5/3(350 + 105/3 2 + 87B 4 + 6B 6 ) , 
fl 2 (1400 + 315B 2 + 339B 4 + 6B 6 ) , 
21fi 3 (25 + 5B 2 + 6fi 4 ) , 
3B 4 (25 + 5fi 2 + 6B 4 ) . 



For n > 3, we first define 

/*(*):= £dyP n (yrf. 

For k — 0, one obtains 
,0/ n Pn+i(x) - P n -i(x) 



(24) 



(25) 



1 +2n 

whereas for k > 1, we make use of the recurrence relation 
for Legendre polynomials, (2n + l)yP n (y) — (n + l)P n+ i(y) + 
nP„-\{y), to find 



w = 



(n + l)lt}{x) + nlt\(x) 



(26) 



2« + 1 

Making use of Eqs. (20) and (21), we then find, for « > 3, 

\2n + 3 4fi t 
t-„(x) = t +n {x) + \\ — - „ , „_ a? 2jA^ +1 (x) . (27) 



2 (1 + Bx) 2 



/>=o 



For three different values of n and )? m ; n = 1', j9- max = 400', the 
functions T- n {ff) are displayed in the lower panel of Fig. 1. 
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3.2. Logarithmic weight functions 

Choosing the T+„ to be polynomials in § implies that the struc- 
ture of these weight functions is similar on all angular scales 
from § m i n to i? max . For example, the roots of the T + „ are fairly 
evenly spread on the interval -& m \ n < § < # max . On the other 
hand, we expect the correlation function to show more 

structure on small scales than on large scales. Hence, for a 
given maximum number N of modes, the large angular scales 
will be sampled on finer scales than needed, whereas small an- 
gular scales may not be sufficiently well resolved to extract all 
information contained in the correlation function. 

In order obtain a finer sampling of the small-scale correla- 
tion function for a given N, we now construct a set of weight 
functions which are polynomials in ln#. Hence the roots of 
these weight functions are approximately evenly spaced in In §, 
thus the weight functions sample small angular scales with 
higher resolution than large angular scales. As in Sect. 3.1, 
this set of functions must fulfill the constraints (4), and we re- 
quire the functions to be orthonormal. Hence, the lowest-order 
weight function again is of second-order. We parametrize this 
set of weight functions as 



n+l n+l 

j=o j=o 



where we choose 



z = ln(#/<W 



(28) 



(29) 



which varies from to z max = ln(i? max /i? m i n ) as & goes from 
i? m i n to # max . Furthermore, we defined c n j = N„c„j with N„ = 
c n (fi+i) + 0, so that c„(, 1+ i) = 1. In this way, the relative ampli- 
tude of the c's is decoupled from the overall normalization N„. 
As before, we set T+%(ff) = t l ^(z) and T l ^(&) = t^(z). With 
this transformation of variables the constraints (4) become 

P" dz z 2z # (z) = = (*" dz e 4 ^ & (z) , (30) 
Jo Jo 



and an orthonormality condition analogous to Eq. (16) can be 
written as 

^r*"d0r£(0rt(0) 



$min f m " , r.loe, -..log, r 

= J Q dz e z tj(z)t + *(z) = 5 n 



(31) 



To write these constraints in a more compact form we define 
the set of coefficients 



J(k 



Jo 



dz e** z j = 



1,-femax) 



(32) 



where y(a, x) is the incomplete Gamma function. 

With the representation (28), the constraints (30) become 



Y^c ni J{2J) = -7(2, n+l), 



7=0 



Y^CnjJi^D = -/(4,n+l) 
7=0 



(33) 



These two equations determine the two coefficients cio, c\\ 
needed to obtain (z). We then obtain the corresponding co- 
efficients c„j by iterating in n. Thus, for a given n, we assume 
that the c m j have been determined for all m < n. Then, the c n j 
are obtained from the two Eqs. (33), and the (« - 1) orthogo- 
nality conditions (31) for 1 < m < n — 1, which read in the 
representation (28) 

n+l m+l 

2^2^J(l,i + j) C mi C nj = , 

7=0 ,=0 



n (m+l \ m+l 

^ ^ J(l, i + j) c mi c nj = - ^ J(l , i + n + 1) c mi , (34) 

y=0 V (=0 / i=0 

where we used that c„(„ + i) = 1. Thus, together we have 77 + 1 
linear equations for the n + l unknown coefficients c n j, < 
j < n, which in principle can be readily solved (but see below). 
Finally, to obtain the normalization of the functions, we use 
Eq. (31) for m = «, which together with Eq. (28) yields 



N: 



1,7=0 



J(l,i + j) = t— = e w - 1 



(35) 



which determines A^,, (and thus the c„j = N n c n] ) up to an (ar- 
bitrary) sign. For definiteness, we choose the sign such that 
t+n fenax) > 0, implying that N„ = c„ ( „ +1) > 0. 

It turns out that the solution of the system of linear equa- 
tions for the c's requires very high numerical accuracy for even 
moderately large «, in particular for large values of # max /# m m. 
We used Mathematica (Wolfram 1991) with large setting of 
WorkingPrecision for calculating the incomplete Gamma 
function and for carrying out the sums in Eq. (34). Once the 
c's have been determined, the integrals in Eqs. (30) and (31) - 
the latter for m < n - have been calculated to check the accu- 
racy of the solution. We found that, for # max /# m i n = 400 and 
for n max = 20, one needs to determine the c's to 40 significant 
digits, in order for all these integrals, which should be zero, to 
attain values less than 0.1. We then calculated the n + l roots 
r„j of the f+,f (z), and represented the functions as 



t+!(z)=N n Y](z-r ni ) . 



(36) 



For the same parameters as before, using only five significant 
digits for the r's renders all the integrals zero to better than 
10~ 6 , and with eight significant digits, the integrals are zero 
to better than 10~ 17 even for n max = 40. Thus, the represen- 
tation (36) is the adequate one for practical work. A short 
Mathematica program for calculating the r n is displayed in 
Fig. 3. 

The corresponding ri°f are constructed from Eq. (3), by 
defining y = ln(#/# m j n ), which yields 



^(z) = d°„ g (z)+4 J o dyt l :°(y)(e 2 ^-3^) 

n+l „ z 

= ^(z)4Vc„ ; . dy We 2 ^-3e 4 <3->) 
Jo 



7=0 
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Nmax=20; tmin=l; tmax=400; zm=Log [Rationalize [tmax/tmin] ] 
gamm [a_ , z_] =Gamma [a , 8 , z] 

Do[J[k, j]=Re[N[gamm[j+l,-k zm]/(-k) " (j+1) , 138]] , {k, 1 , 2} , {j ,8 , 2 Nmax+1}] 

Do[J[4, j]=Re[N[gaimn[j+l,-4 zm]/(-4) " (j+1) , 138]] , {j ,0, 2 Nmax+1}] 

Do[ 

Do[a[n,j]=l[2,j]/J[2,n+l]; a[n+l , j]=J [4, j]/J [4,n+l] , { j ,8,n}] ; b[n]=-l; b[n+l]=-l; 

Do[a[m, j]=NSum[J [1 , i+j] c[m,i] ,{i,8,m+l}, WorkingPrecision->80 , NSuitiTerms->Nmax] , {m, 1 ,n-l} , {j ,8,n}] ; 
Do[bb[m]=-NSum[J [l,i+n+l] c [m, i] , {i , 8 ,m+l} , WorkingPrecision->80, NSumTerms->Nmax] , {m, 1 ,n-l}] ; 
Do[a[m, j]=a[m, j]/bb[m] , {m, 1 ,n-l} , { j , 0,n}] ; Do[b[m]=l , {m, 1 ,n-l}] ; 
A=Table[a[i, j] , {i , 1 ,n+l} , { j ,0,n}] ; B=Table[b[i] ,{i,l,n+l}] ; 
CC=LinearSolve[A,B] ; Do[c[n, j]=CC[[j+l]] ,{j ,0,n}] ; c[n,n+l]=l; 
tt[n,z_]=Simplify [Sum[c[n, j] z" j , {j ,0,n+l}]] ; 

roots=NSolve[tt[n,z]==8,z] ;Do[r[n, j]=N[roots[[j , 1,2]] ,8] ,{j , l,n+l}] ; 
t[n,z_]=Product[(z-r [n, j]) , {j , l,n+l}] ; 

normgral=NIntegrate[Exp[z] t[n,z] "2 , {z,8,zm} ,WorkingPrecision->50] ; 
norm[n]=Sqrt[(Exp[zm]-l)/normgral] ; t[n,z_]=t[n,z] norm[n] , 
{n, 1 ,Nmax}] 

ROOTS=Tab le[r[n,j],{n,l, Nmax } , { j , 1 , Nmax+ 1 } ] 



Fig. 3. Mathematica (Wolfram 1991) program to calculate the roots in Eq. (36) - they are stored with 8 significant digits in the lower left halve 
of the table ROOTS. Furthermore, the array norm[n] contains the normalization coefficients N„ 
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Fig. 4. The logarithmic filter func- 
tions T l °% for & mm = 1' and 
i?max = 400'. The left panel shows 
the function over the whole interval, 
whereas the right panel provides a 
more detailed view for small i? 
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Fig. 5. The logarithmic filter func- 
tions r!°„ g for # min = 1' and # max = 
400'. As in Fig. 4, the left panel 
shows the function over the whole 
interval, whereas the right panel 
provides a more detailed view for 
small & 



-»./ 



7=0 



(-2y +i 



y(j + l,-2z) 



(37) 



3e 



-2: 



2J+i 



■y(j+ l,-4z) 



given in the form (36). Alternatively, making use of the repre- 
sentation 



Given the remarks above, the first of these expressions (i.e., 
numerical integration) is the method of choice if the f+,f (z) are 



y{j + hz) = j\ 



■y- 

ml 

m-0 
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Fig. 6. The W„ -functions calculated from T^ 08 . The upper panel cor- 
responds to i? max = 400', whereas the lower panel is calculated using 



one can write the f lo ,f (z) as 

n 

t l °n(z) = a n 2& lz - a„AeT Az + ^ d„, 

m=0 

where the coefficients are given as 
, v c "j jl 

a„2 — 4 > r-r , 

Zj (-2)J +l 



(38) 



n+ 1 



-nj 



" 4 Zj C-4V+1 



-i^c„,;!(-2r-- 1 (3 2»-- 1 -l). 



(39) 



In Figs. 4 and 5, we have plotted the filter functions T± t f for 
f^min = 1' and # max = 400'. The left panels show these filter 
functions over the whole angular range, the right panels show 
an enlargement for small values of §. As expected, the roots 
of the weight functions are clustered towards lower values of 
Thus, for a fixed maximum number of n, these functions re- 
solve those scales better than the linear filter functions. Figure 
6 shows the filter functions W„{£) which, according to Eq. (5), 



10 20 50 
[arcmin] 



100 200 



500 



Fig. 7. The 2PCFs £+(i?) and the corresponding pure E-mode corre- 
lation functions for i? m j n = 1', i? max = 400', and the fiducial 
cosmological model described in Sect. 4 

relates the COSEBIs to the underlying power spectrum P^{€). 
With increasing n, the COSEBIs are sensitive to power at in- 
creasingly larger values of (. 

3.3. E-/B-mode correlation functions 

Crittenden et al. (2002) and Schneider et al. (2002) constructed 
E-/B-mode correlation functions, which consist of the original 
correlation function <f+(i?) plus a correction term which is again 
an integral over correlation functions. However, these correc- 
tion terms are unobservable, since the integral extends over an 
infinite angular range. Thus, these E-/B-mode correlation func- 
tions cannot be obtained in practice and are of little use. 

With the full E-/B-mode decomposition provided by the 
COSEBIs, we can define new pure E-/B-mode correlation func- 
tions, 



£±(#) = — J] E n TM , 



2 



(40) 



obviously, the £jj only depend on the E-mode shear, whereas 
the f± contains information only from B-modes. Owing to the 
constraints (4) which the functions T+„ have to obey, one finds 
that 



d# § 3 



(41) 



In fact, as shown in SK07, the function T also obeys analogous 
constraints, namely 

d# f v » d& 



so that 



J** ~® 



£(#) = = 



em 



(42) 
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Fig. 8. The correlation coefficients (44) for linear (top) and logarithmic 
(bottom) weight functions T+„, calculated for & mi „ = 1', i? max = 400', 
and the fiducial cosmological model described in the text 



In Fig. 7, we have plotted the pure E-mode correlation func- 
tions together with the orinial 2PCFs £ ± , for a fiducial 
ACDM cosmological model that will be described in the next 
section; the overall shape of these functions, however, does not 
depend on the details of the choice of cosmological parame- 
ters. Although not easily visible, both have two roots, as 
required by the constraints (41) and (42). The function is 
rather similar in shape to the original 2PCF f +) modified in 
a way as to obey Eq. (41). However, £ E has a very different 
shape than In fact, it is easy to see from Eqs. (3) and (4) 
that^(# min) = ^(■& ain ), £ E (# max ) = ^OW). In Appendix C, 
we show how these new pure-mode correlation functions are 
related to the original 2PCFs. As is obvious from their defi- 
nition, these pure-mode correlation functions can be obtained 
from the 2PCFs over a finite interval, hence their estimation 
does not require extrapolations or 'inventing data'. 



4. Likelihood analysis 

We calculate the posterior likelihood in the £2 m -<x 8 parameter 
space for four cases of COSEBIs ()? max = 400', # max = 20', 
each for T X ° g and T^ n ). Note that, unless stated otherwise, we 
choose ff min = 1 ' as the minimum separation in the 2PCF. For 
each of the four cases we are interested in two main questions: 
First, how does the information content evolve when including 
more modes n in the likelihood analysis? Second, once it satu- 
rates, how large is the difference to the information content of 
the 2PCFs? 



4.1. Model choice 

In the likelihood analysis we assume a flat universe, and vary 
the matter density Q m (and simultaneously £2a = 1 - Q. m 
to preserve flatness) and the normalization cr 8 of the density 
fluctuations; all other parameters are held fixed, i.e. the di- 
mensionaless Hubble constant h = 0.73, the density param- 
eter in baryons Ob = 0.044, and the slope of the primordial 
fluctuation power spectrum n s = 1.0. We choose O m = 0.27 
and <x 8 = 0.78 as our fiducial model which enters the likeli- 
hood analysis in this section and represents the cosmological 
model used in Fig. 7. The B-mode power spectrum is set to 
zero, Pb({) = 0, whereas the shear power spectra Pe are ob- 
tained from the three-dimensional density power spectra Pg us- 
ing Limber's equation (see, e.g., Kaiser 1998). The power spec- 
trum Pg is calculated with the transfer function from Efstathiou 
et al. (1992). For the non-linear evolution we use the fitting 
formula of Smith et al. (2003). In the calculation of Pe we 
choose a redshift distribution of source galaxies similar to that 
of Benjamin et al. (2007), 



n(z) = 



zoT((l + a)//3)\zo 



exp 



(43) 



with a = 0.836, p = 3.425, zo = 1-171. The correspond- 
ing 2PCFs are calculated from Eq. (7), and from these, the 
COSEBIs are calculated according to Eq. (1) for various modes 
n using linear and logarithmic filter functions. The covari- 
ances used in our likelihood analysis are calculated from the 
power spectrum P E as described in Joachimi et al. (2008), as- 
suming our fiducial cosmology. This method does not account 
for the non-Gaussianity of the shear field or the cosmology- 
dependence of the covariance (Eifler et al. 2009), however 
these issues are not crucial for our purpose as we are only inter- 
ested in the relative performance of COSEBIs and the 2PCFs. 
More important is that we can choose an arbitrary binning in 
the 2PCF covariance. The latter aspect in combination with the 
speed of the calculation is decisive to resolve the numerical 
issues in the calculation of the COSEBIs' covariance. The sur- 
vey parameters read A = 170 deg 2 , n ga i = 13.3/arcmin 2 , and 
cr e = 0.42, and correspond to those of the upcoming cosmic 
shear analysis of the full CFHTLS survey area. 

The exact method to calculate the posterior likelihood from 
the data vectors and covariances is described in Eifler et al. 
(2010). Similar to their analysis we assume flat priors inside 
the intervals Q m 6 [0.01; 1.0] and cr 8 e [0.4; 1.4], and zero 
prior otherwise. 
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Fig. 9. Likelihood contours 
for a fiducial cosmic shear 
survey, with parameters de- 
scribed in Sect. 4.1. The up- 
per (lower) six panels corre- 
spond to i? max = 400' (20'). 
Shown in the first and third 
rows are the likelihood as 
obtained from the COSEBIs 
with linear filter functions 
and various « max , in the sec- 
ond and fourth rows the 
likelihood as obtained from 
the logarithmic filter func- 
tions, and in comparison, 
we show the likelihood ob- 



o.o 0.2 o.4 o.6 o.8 1.00.0 o.2 o.4 o.6 o.a 1.00.0 0.2 o.4 o.6 o.8 i.o tained from the shear two- 
Am A m A m point correlation functions 



4.2. The covariance of the COSEBIs 

In Fig. 8 we have plotted the correlation matrix of the 
COSEBIs, defined as 

C E 



- mn 



/rE ce 9 

V mm nn 



(44) 



for several values of m, using both linear (upper panel) and log- 
arithmic (lower panel) weight functions. The value of m can be 
identified as the point where r mn = 1. For the linear weight 
functions, we see that the correlation matrix declines quickly 
for n + m, reaches a (negative) minimum at n — m + 2, and 
essentially is zero for \m - n\ > 4. Thus, the covariance ma- 
trix is in essence a band matrix. For the logarithmic COSEBIs, 
the non-zero correlations between the E„ span a larger range in 
\m - n\. One therefore expects that the inversion of the covari- 
ance matrix for a given « max is more difficult for the logarithmic 



COSEBIs than for the linear ones. However, as we will show 
below, a smaller number of logarithmic COSEBIs are needed to 
extract all the cosmological information contained in the shear 
correlation functions, compared to the linear COSEBIs. 



4.3. Figures of Merit: a short discussion 



In order to illustrate the information content one usually calcu- 
lates the so-called credible regions, inside of which the true set 
of parameters is located with a probability of e.g. 68%, 95%, 
99.9%. Instead of showing likelihood contours for all cases 
considered, we use two different measures to quantify the size 
of these credible regions, where each measure characterizes the 
information contents through a single number. 
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Fig. 10. The values of q - see Eq. (46) - 
as a function of the maximum mode n„ 



calculated from the COSEBIs for the case of linear (circles) or logarithmic (triangles) r„-functions, 
a which was included in the likelihood analysis. The results in the left (right) panel correspond to 



#max = 400' (#max = 20'), and the filled symbols are calculated for # min = 1'; in the left panel, we also plot corresponding results for # min = 2', 
indicated by the open triangles. The dashed (dash-dotted) line represents the optimal q for = 1' (i? m j n = 2'), obtained when using the 2PCFs 
directly. The dotted lines shows the asymptotic value of q achieved for large « max 



The first measure, q, is calculated from the determinant of 
the second-order moment of the posterior likelihood p(n\d), 



Qij = J d 2 7tp(7l\d) (m - n])(nj - n)) , 



(45) 



where 7r, are the parameters of the model, and 7r* are the pa- 
rameters of the fiducial model (here, i — 1,2, corresponding to 
O m and cr 8 ). We quantify the size of the credible region by the 
square root of the determinant of Q, 
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U \2- 



(46) 



Smaller credible regions in parameter space correspond to 
smaller values of q. In this paper, all q's, are given in units of 
1(T 4 . 

Our second figure of merit is obtained from the Fisher in- 
formation matrix (Tegmark et al. 1997) 



Fy^frfc-^C^Cj + C^My] ; 



(47) 



where subscripts separated by a comma denote partial deriva- 
tives with respect to 7T/, and My = E^E'j + EyE' ,-, where 
E is the n max -dimensional vector of the first « max E n 's, The 
«max X «max -dimensional covariance matrix C has the elements 
Cj„,, as given in Eq. (8). We consider a constant covariance 
in parameter space, so that the first term of Eq. (47) vanishes. 
Since the Fisher matrix is the Hessian of the (negative of the) 
log-likelihood function at its maximum, its elements describe 
the size and shape of ellipses of constant likelihood near the 
maximum. If the likelihood was strictly Gaussian, the Fisher 
matrix would completely describe its functional form. We de- 
fine our second figure of merit / as 



/ = 



1 



Vdet(F) 



(48) 
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Fig. 11. The q of the COSEBIs as a function i? max , for & mm = 1'. The 
COSEBIs are calculated from r,', og , where n ranges from 1-10 



For a better comparison with q we chose to modify the more 
commonly used figure of merit definition (see e.g., Albrecht 
et al. 2006) - we consider the area of the error ellipse itself, 
not its inverse. Similar to q, f is given in units of 10~ 4 . With 
the definition (48), q and / give the same result if (1) the like- 
lihood in the parameter space considered is Gaussian and (2) 
if the likelihood outside the region where we set a flat prior is 
negligible. We note that / and q can be significantly different 
if these two assumptions are not satisfied. Then, the Gaussian 
defined by the Fisher matrix is only a useful approximation 
close to the fiducial model, and the resulting values of / can 
be rather bad approximations for q. In contrast, q is sensitive to 
parameter regions far from the fiducial model and we therefore 
consider q as the more useful measure for the information con- 
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tents. In order to give an impression of the meaning of different 
q and / we show a sample of likelihood contours in Fig. 9 - it 
is obvious that the likelihood function in our case is far from 
Gaussian. 

4.4. Results of the likelihood analysis 

Figure 10 shows the values of q for the case of # max = 400' 
(left panel) and for § m3X = 20' (right panel). The triangles cor- 
respond to the COSEBIs from t]° s , whereas the circles corre- 
spond to the COSEBIs calculated using the linear T„. For com- 
parison we show the information content of the 2PCF (dashed 
line), which serves as an upper limit on the information con- 
tent of any second-order E/B-decomposing measure - since the 
2PCFs contain all information from second-order shear mea- 
surements and the COSEBIs are derived from them (Eifler et al. 
2008). 

For i? max = 400' the minimum value of q obtainable from 
the COSEBIs - and thus the available information on the two 
cosmological parameters considered - is extremely close to 
that obtained from the 2PCFs. The logarithmic E„ reach this 
threshold already for n max = 5, whereas the linear E„ saturate 
around n max = 25, indicating that the logarithmic modes cap- 
ture the bulk of cosmological information in significantly fewer 
data points compared to the linear case. This property can be 
particularly important in higher-dimensional parameter spaces, 
where data compression and computing time become impor- 
tant. 

The COSEBIs for # max = 20' saturate much earlier; the in- 
formation content of E„ is hardly increased when going beyond 
n — 4 {n — 3 for the logarithmic weight functions). More im- 
portant, however, is the large difference between the saturation 
limit of the COSEBIs and the corresponding information con- 
tent of the 2PCFs (which is also seen in the likelihood contours 
of Fig. 9). Obviously, the choice of # max has a significant im- 
pact on the information content, and on the relative information 
contained in the COSEBIs and the 2PCFs. 

This latter difference is not due to a deficiency of the 
COSEBIs - since they form a complete set of E-/B-mode mea- 
sures, they contain all the information that can uniquely be split 
into the two modes. If, however, one assumes that the shear 
field has no B-mode contribution, and thus using of the full 
2PCFs obviously yields tighter parameter constraints. But, this 
assumption will hardly be justifiable in any of the forthcoming 
surveys. The fact that the measured B-modes are compatible 
with zero within the error bars in a data set is not a justification 
- since any realistic survey may contain B-modes which cannot 
be identified as such, for example a uniform shear field which 
can either be E- or B-mode. Therefore, the loss of information 
due to a clean mode separation is inevitable, but a small price 
to pay relative to a potential bias of results due to undetected B- 
modes. Fortunately, for surveys which allow shear correlation 
measurements on large angular scales, this information loss is 
seen to be almost negligible. 

We analyse this more closely in Fig. 11, where we show 
q as a function of # max ; here we use logarithmic weight func- 
tions with 10 modes, i.e., where the asymptotic limit is well 



Table 1. Values of q and / as obtained by considering the full 2PCFs, 
and by using the COSEBIs E Ym , and £ log 
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2PCF 
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2' 
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400' 
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25.46 
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11.91 



achieved. The amount of information increases significantly 
when going from 20' to 100' and becomes almost constant 
when going to larger # max . This behavior, of course, depends on 
the parameter space considered; for £l m -o~% it can be understood 
from the functional behavior of the power spectrum. For small 
i, it is almost fully degenerate in these two parameters, hence 
going to larger angular scales does not yield significantly more 
information - this will be different for other parameter combi- 
nations. One also sees that the difference in information content 
between the COSEBIs and the 2PCFs decreases for larger # max 
- the larger # max , the smaller is the contribution of modes to 
the 2PCFs which can not be uniquely decomposed into E/B- 
modes. Furthermore, we have plotted the corresponding values 
of q for the aperture dispersion (M^ff)^, where 9 = # max /2 is 
the aperture radius which is calculated from the shear 2PCFs 
for ■& < i? max . Values for (M^ff)^ are calculated and plotted 
only for 6 > 40', to limit the bias caused by the lack of mea- 
sured correlation functions for ■& < & m \ n (see Kilbinger et al. 
2006) to < 5%. We see that its information content is smaller 
than that of the COSEBIs, as must be the case, owing to the 
completeness of the latter. 

Figure 12 shows a similar analysis based on /. The results 
confirm our foregoing findings. Similar to the case of q, the 
Fisher matrix analysis shows that the logarithmic E„ reach the 
saturation limit much earlier than the linear E„ and again, the 
saturation limit for # max = 400' is closer to the optimal infor- 
mation content than for # max = 20'. 

Note that in Figs. 10 and 12 we choose a similar scale for 
the vertical axis in the right and the left panels to enable for an 
easier comparison between the different cases of ?? max . We point 
out the good agreement between the saturation limits of E„ and 
e]° s in all cases, which shows that our results are numerically 
robust. In Table 1, we have listed the values of q and / as shown 
in Figs. 10 and 12 for the maximum number « max of modes. 
The small difference between these values as obtained from 
the linear and logarithmic weight functions for the COSEBIs is 
due to the fact that for these values of « max , the linear ones have 
not yet reached their full asymptotic value. 

The underlying reason why the formal loss of information 
of the COSEBIs, relative to the full 2PCFs, is larger for smaller 



Peter Schneider et al.: COSEBIs: Extracting the full E-/B 



-mode information from cosmic shear correlation functions 



13 



linear T n (d min = 1 ') 
logarithmic T n (i3 min = 1 ') 
2PCF (* mln = 1') 
logarithmic T n (d m)n = 2') 
2PCF (d min = 2') 



A-AAAAAA- ■* • • « « . 





• using linear T n 
a using logarithmic T n 
2PCF (optimal case) 


• 

• 

• 

a a . : • • • 

A. * A... A A 


X XX : i i 





10 



15 

mode n max 



20 



25 



30 



10 



12 



14 



mode n„ 



Fig. 12. The value of / - see Eq. (48) - for the case of linear (circles) or logarithmic (triangles) r„-functions as a function of the maximum 
mode rc max which was included in the likelihood analysis. The results in the left (right) panel correspond to i? max = 400' (i? raax = 20'), and the 
filled symbols are calculated for i? rain = 1'; in the left panel, we also plot corresponding results for # rain = 2', indicated by the open triangles. 
The dashed (dash-dotted) line represents the optimal / for # m j n = 1' (i? m i n = 2'), obtained when using the 2PCFs directly. The dotted lines 
shows the asymptotic value of / achieved for large n mllx 



$max is due to the filter functions that relates the 2PCFs to 
the power spectrum. This filter function is Jq(x) for the case 
of £+{ft), i.e. a function that tends towards +1 for small argu- 
ments. This implies that the correlation function is sensi- 
tive to long-range modes, i.e., modes of small (. In particular, 
this means that £ + is also sensitive to the power spectrum for 
modes satisfying 6 < 2n:/ft max , corresponding to scales which 
are in fact not probed by the 2PCFs directly - and for which no 
E-/B-mode separation is possible from the data. The relative 
cosmological information content of the power spectrum in the 
ranges I < 27i/ft m . dx and 2n/ft m . dx < I < 2n/ft m i n decreases 
with increasing ft m3X , which explains the difference in 'relative 
information loss' in Figs. 10 through 12. 

Up to now we have always chosen = 1'. However, 
one may ask whether cosmic shear measurements down to this 
angular scale can be compared to sufficient accuracy with cos- 
mological predictions, since at the corresponding length scales, 
baryonic physics can have a significant influence on the pro- 
jected power spectrum. Of course, modeling the behavior of 
baryons in a cosmological simulation is much more difficult, 
and burdened with higher uncertainty, than dark matter-only 
simulations. Jing et al. (2006) compared pure dark matter sim- 
ulations with hydrodynamic simulations to conclude that for 
( ~ 10 4 , corresponding to ft ~ 1', the predicted power spectra 
differ by about 10% - much more than the predicted statistical 
uncertainty of future cosmic shear surveys. 

Fortunately, the largest effect of baryons on the total mass 
distribution seems to be a change of the halo concentration 
parameter as a function of halo mass (Rudd et al. 2008), in 
that baryons render halos more concentrated. If this is the 
case, then this effect can be calibrated from the weak lens- 
ing data themselves. Zentner et al. (2008) studied such a self- 
calibration method for future surveys and concluded that the 
concentration-mass relation can be determined from the weak 



lensing data. In the framework of the halo model for the large- 
scale structure, the power spectrum can then be calculated us- 
ing this modified concentration-mass relation, and fairly accu- 
rate model predictions can be made. 

Dropping the small angular scales from future surveys im- 
plies considerably weaker cosmological constraints. In the left 
panels of Figs. 10 and 12, we have plotted the values of q 
and /, respectively, for surveys with ft m [ n = 2'. Independent 
of whether the 'optimal' constraints from the 2PCFs or the 
COSEBIs are employed, the resulting constraints are weaker 
than for i? m j n = 1'. Therefore, it is of considerable interest to 
improve the accuracy of predictions for the matter power spec- 
trum to small scales, to make full use of the information con- 
tained in cosmic shear surveys on small angular scales. 



5. Summary and discussion 

We have defined pure E- and B-mode cosmic shear measures 
from correlation functions over a finite interval ft m i n < ft < 
#max- These are complete orthonormal sets of such measures, 
implying that they contain all cosmic shear information in the 
two-point correlation functions which can be uniquely split into 
E- and B-modes. For these COSEBIs, we have calculated their 
relation to the underlying power spectrum and their covariance 
matrix. Two different sets of COSEBIs have been explicitly 
constructed, those with weight functions which are polynomi- 
als in the angular scale, and those with polynomial weight func- 
tions in the logarithm of the angular scale. For the former case, 
analytic expressions were obtained for all orders, whereas in 
the logarithmic case, a linear system of equations needs to be 
solved numerically. 
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5.1. Advantages of the COSEBIs 

Comparing the COSEBIs with earlier cosmic shear measures, 
we point out a number of advantages. First, using the correla- 
tion functions themselves does not provide an E-/B-mode sep- 
aration. The construction of E-/B-mode correlation functions 
as described in (Crittenden et al. 2002) requires knowledge of 
the correlation functions over an infinite angular range, and is 
therefore not applicable in practice (extrapolating to infinite 
separation using fiducial cosmological models corresponds to 
'inventing data', and implicitly assumes that there are no long- 
range B-modes). In fact, the generalization of pure E-/B-mode 
correlation functions based on data over a finite angular range 
has been derived here (see Sect. 2 and Appendix C); however, 
we expect these to be of limited use in practice. 

Whereas the aperture mass dispersion (Schneider et al. 
1998) provides a clean separation into E- and B-modes 
(Crittenden et al. 2002; Schneider et al. 2002), it requires the 
knowledge of the correlation function to arbitrarily small an- 
gular separation. There are at least two aspects which render 
this impractical: first, galaxy images need a minimum separa- 
tion for their shapes to be measurable. Second, on very small 
scales baryonic effects will affect the power spectrum and ren- 
der model predictions very uncertain. The inevitable bias of 
the aperture mass dispersion (Kilbinger et al. 2006) motivated 
the ring statistics (Schneider & Kilbinger 2007). The latter re- 
moves the bias, depends only on the correlation function over 
a finite interval, and has potentially higher sensitivity to cos- 
mological parameters (Eifier et al. 2010, FK10). However, the 
weight function of the ring statistics is largely arbitrary. 

The COSEBIs contain all available mode-separable infor- 
mation from the correlation functions on a finite interval, and 
are therefore guaranteed to provide highest sensitivity to cos- 
mological parameters. Furthermore, they form a discrete set of 
measures, whereas the other cosmic shear statistics include a 
somewhat arbitrary grid of variables, like the outer scale of the 
ring statistics: if the grid is too coarse, information gets lost, 
whereas a finer grid renders the measures largely redundant, 
implying large and significantly non-diagonal covariances. In 
contrast, the discreteness of COSEBIs leaves no freedom, and 
for the linear weight functions, the covariances have a nar- 
row band structure. The information clearly saturates after a 
number of modes, and this number is surprisingly small for 
the logarithmic weight function. Therefore, determining co- 
variance matrices from numerical simulations (as was done for 
the COSMOS analysis of Schrabback et al. 2009) appears con- 
siderably simpler than for other cosmic shear measurements, 
which is particularly true for an unbiased estimate of their in- 
verse (see Hartlap et al. 2007, for a discussion of this point). 
Based on these properties of the COSEBIs, we would like to 
advertise them as the method of choice for future cosmic shear 
analyses. 

5.2. Generalizations 

In case photometric redshift information of the lensed galaxies 
is available and several source populations can be defined based 
on their redshift estimates, the COSEBIs can be generalized to 



a tomographic version. Furthermore, under the same assump- 
tion, intrinsic alignment effects between the tidal gravitational 
field and the intrinsic galaxy orientation (e.g., Catelan et al. 
2001; Crittenden et al. 2001; Jing 2002; Hirata & Seljak 2004) 
can be filtered out by properly choosing redshift-dependent 
weight functions, such as to avoid physically close pairs of 
galaxies (King & Schneider 2002, 2003; Heymans & Heavens 
2003) or make use of the specific redshift dependence of the 
shear-intrinsic alignments (Bridle & King 2007; Joachimi & 
Schneider 2008, 2009), possibly in combination with other data 
(Joachimi & Bridle 2009). We expect that these generalizations 
of the COSEBIs provide no real difficulties. 

It would be desirable to obtain a similar measure for third- 
order cosmic shear statistics, i.e., one that provides clear E-/B- 
mode separations from three-point correlation functions mea- 
sured over a finite interval. Up to now, the aperture statistics 
is the only known such measure (Jarvis et al. 2004; Schneider 
et al. 2005); however, similar to the case of the aperture dis- 
persion, third-order aperture statistics requires the correlation 
functions to be measured down to arbitrarily small separations. 
A generalization of the COSEBIs to third order seems challen- 
ing - not only because of the higher number of independent 
variables (the three-point correlation functions depend on three 
variables) and the larger number of modes (one pure E-mode, 
one mixed E/B-mode, and two further modes which are not 
invariant under parity transformation), but also because of the 
more complicated relation between correlation functions and 
the bispectra (Schneider et al. 2005). Thus, even the analogue 
of the starting point of the current paper - Eqs. (1) and (3) - is 
not yet known for the third-order case. 
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Appendix A: Calculation of the COSEBIs: 
Numerical problems and solutions 

Several numerical issues arose during the implementation of 
the calculations of the COSEBIs, especially in the context of 
their covariance. As these issues are crucial for obtaining the 
correct values of q and /, we outline them in greater detail. We 
employ the QAG adaptive integration routine from the GNU 
Scientific Library 1 and obtain the E„ using two different meth- 
ods. First, we calculate them from the set of 2PCFs according 
to Eq. (1) and second, we check for consistency by calculating 
E„ directly from the Pe according to Eq. (5). The first method 
cleanly separates E- and B-modes, giving a B-mode residual 
due to numerical uncertainties which is 8 to 5 orders of mag- 
nitudes lower than the E-mode, depending on the scales con- 
sidered and whether one uses T„ or T^° g . Both methods yield 

1 http://www.gnu.org/software/gsl/ 
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results in perfect agreement, hence we are confident that there 
are no numerical problems in either of them. 

When using a binned version of the 2PCF instead, we find a 
non-negligible deviation when using too few angular bins. The 
number of bins, above which the E/B-decomposition becomes 
stable, depends on the mode E n , the maximum scale § mdK of the 
2PCF, and whether one uses T n or T X ° g . This should be checked 
carefully before applying the method to an actual data set. As 
an example, we found that for linear T„ and # max = 400', one 
needs ~ 10 5 bins to calculate £30 properly and to have an ac- 
curate mode separation. 

The calculation of the covariance C\ n is numerically more 
challenging than that of the data vectors. Again, we use two 
approaches and calculate C E from Pe using Eq. (8), and from 
the 2PCF covariance using Eq. (9). Both methods have their 
difficulties and need to be checked carefully for consistency 
before using the covariance in the likelihood analysis. 

The power spectrum approach using Eq. (5) involves the 
calculation of a one-dimensional integral over Pe multiplied by 
two filter functions W± . As can be seen from Figs. 2 and 6, these 
filter functions are strongly oscillating, which becomes worse 
for large 6 and higher modes n. We use a stepwise integration 
to calculate the integral and truncate the integral once the ratio 
of "new contribution in step ; / integral calculated until (; - 1)" 
drops below a certain threshold. We vary the width of the steps 
as well as the truncation threshold; however, we find that the 
integration becomes inaccurate when going to higher modes n. 

For calculation of C E „, from the covariance of the 2PCFs 
we find that it is too time-consuming to calculate the 2PCF 
covariance for every sampling point of the integration routine 
separately. Instead, we calculate the 2PCF covariance for a spe- 
cific binning and interpolate the values during the integration. 
We use a linear binning in the 2PCF covariance for the linear 
weight function and a logarithmic binning for the case of T l ° s . 
In addition, we check how strongly the number of bins influ- 
ences the accuracy of the integral, finding that we can calculate 
C E properly if we choose at least 1000 x 1000 bins in the 2PCF 
covariance. The final C E must be symmetric, positive definite, 
and not ill-conditioned, as already small deviations from these 
requirements can bias the information content measures q and 
/■ 



The noise N of E is obtained through the covariance of the E n , 



N 2 = (E 2 ) - (E) 2 = J] a m a n C 

m,n=l 

yielding as signal-to-noise ratio 

S ^ Zun ®n^*n 

N ~ 



E 

mn 



(B.3) 



(B.4) 



To obtain a maximum of S /N with respect to the coefficients 
a n , we differentiate the foregoing expression with respect to a 
coefficient a^, 



_d_S 

da k N 



2N 3 



= N" 



N 2 E k - 



Yc 



\ n 



Setting this derivative to zero results in 



Ei = 



Yc 



(B.5) 



(B.6) 



From this equation we see that the overall amplitude of the a n 
cannot be determined, i.e., if the a„ are a solution, then Aa n 
solve the equation as well. Noting that the first term on the 
r.h.s. of Eq. (B.6) does not depend on k, a solution is obtained 

as 

n 

as can be also verified by inserting this into Eq. (B.6). Thus, 
if the function T+ is expanded into a set of functions which all 
satisfy the constraints (4), the signal-to-noise maximization can 
be done analytically. If different sets of functions are used for 
constructing the T + maximizing the S/N, the resulting function 
should be the same in the limit N — > 00; however, different 
sets of functions may require different N before the asymptotic 
limit is reached. 



Appendix B: S/N maximization 

From a complete set of functions T + „ obeying the constraints 
(4) for given -& mm and i? max , we can find a weight function T+(-&) 
which maximizes the signal-to-noise of the E-mode. This prob- 
lem was also considered by FK10. In this case, we can write 



Z" + (tf) = J] On T + „(§) , 



(B.l) 



which satisfies the integral constraints (4) for any choice of the 
a„. Then the E-mode signal is, in the absence of B-modes, 



(B.2) 



Appendix C: Pure E-/B-mode correlation functions 

We will now explore how the pure-mode correlation functions 
introduced in Eq. (40) are related to the original £ ± . For this, we 
use Eq. (1) in the definition (40) to obtain 



^' B (#) = 



j CO 

CO 



«=i 

dcp ip 



(CI) 



where fi = +1 for E-modes, // = -1 for the B-modes, and 
where we defined the functions S ±± (§,tp) in the last step. 
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These functions are calculated next, by noting that the normal- 
ized Legendre polynomials p n (x) as defined in Eq. (19) are or- 
thonormal, 



n=0 
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Owing to symmetry, 

oo 

s-+(x,y) = ^ t-„(x) U„(y) = s+-(y, x) , (C.9) 
n=i 

and 

oo 

s—(x,y) = 2^ t- n (x) t- n (y) 

= s ++ (x,y) + K(y-xMx,y) (CIO) 
+ H(jc - y)Q(y, x) - V(x, y) - V(y, x) + W(x, y) , 
where the symmetric function W is defined as 



dx p„(x) p m (x) = 6„ 



form a complete set of functions on the interval [—1,1], and 
therefore obey 



^ Pn(x) p„(y) = S D (x - y) 



(C2) 



Noting that we have chosen in Sect. 3.1 t n (x) = p n +\(x) for 
n > 3, we find that 

s++(x,y) = ^ t +n (x) t +n (y) 

n=\ 

oo 2 



Xmin(x,y) 
dz0(z,x)0(z,y) 



J>I> 



F ++ (z,z')0(z,x)g(z',y) 



(C.ll) 



n=4 

3 2 

= <5 D (* - y) -^Pn(x) p„{y) +^ f+nW f+n(v) 
n=0 «=1 

=:(5 D (x-y)-,F ++ (x,;y) , 



Thus, we find for the S ±± (ft, <p) in turn, using x - 2(ft - ft) /Aft 
andy = 2(ip-ft)/Aft: 

S ++ (ft,<p) = — 8 D (ft-<p)-F ++ (x,y), 



S + -(ft,<p) 



■ 5 D (ft -if)- F + _(x, y) + H( V - ft) Q(x, y) 



(C.3) S -+{■&, ip) = S+-(ip,ti), 



(C12) 



where in the final step we have defined the function F ++ (x,y), 
which is obviously symmetric in its arguments. The explicit 
expression for it reads 



F++(x,y) = 



5(1 +Bx)(l +By) 



(C4) 



8(175 + 35fi 2 + 45 B 4 + B 6 ) 
x (l40(l + 3xy) + 70B(3xy - 5)(x + y) 
+ 7B 2 [39 + 20xy - 25(x 2 + y 2 ) + 15x 2 y 2 ] 
+ 14fi 3 (5xy- 3)(jc + y) + B 4 [15 - 21(x 2 + y 2 ) + 35x 2 y 2 ]\ . 

We can now calculate the other sums in Eqs. (C.l), making use 
of Eq. (20) written in the form 



S __(#, <p) = ^ 6 D (ft -<p) + 1% - ft) g(x, y) 

+ H(#-<p)g(y,x)-F-(x,y), 

with F+-(x,y) = F ++ (x,y) + V(x,y), F__(x,y) = F ++ (x,y) + 
V(x, y) + V(y, x) - W(x, y). We finally obtain for the pure mode 
correlation functions 

+/£_(#) 
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dip ip 



+ (it-(<p)F + - 



t + (<p) F + 



2(ft-ft) 2{ip-ff) 
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£■ 



+ P- 



t-„(x) = t + „(x) + I dz t +n (z) Q{z, x) 



with 



0(z,x) = 



45 



(1 +Bx) 2 
This then yields 



1 +Bz- 



3(1 +Bz) 3 



(l+Bx) 2 
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(C.13) 



= 6 D (x -y)- F ++ (x, y) + H(y - x) £(x, y) - V(x, y) , 



where 



V(x,y) 



dzF ++ {x,z)G(z,y) 



(C.8) 



Hence, pure mode correlation functions can be obtained 
from the observed correlation functions over a finite interval. 
However, we believe that these pure mode correlation func- 
tions are of little practical use, since for a quantitative analy- 
sis of cosmic shear surveys the COSEBIs contain all relevant 
information. 
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